SUM RULES AND THE SZEGO CONDITION FOR 
ORTHOGONAL POLYNOMIALS ON THE REAL LINE 



BARRY SIMON 1 AND ANDREJ ZLATOS 

Abstract. We study the Case sum rules, especially Co, for gen- 
eral Jacobi matrices. We establish situations where the sum rule 
is valid. Applications include an extension of Shohat's theorem to 
cases with an infinite point spectrum and a proof that if lim n(a n — 
1) = a and limn& n = (3 exist and 2a < \/3\, then the Szego condi- 
tion fails. 



1. Introduction 

This paper discusses the relation among three objects well known 
to be in one-one correspondence: nontrivial (i.e., not supported on a 
finite set) probability measures, u, of bounded support in R; orthog- 
onal polynomials associated to geometrically bounded moments; and 
bounded Jacobi matrices. One goes from measure to polynomials via 
the Gram-Schmidt procedure, from polynomials to Jacobi matrices by 
the three-term recurrence relation, and from Jacobi matrices to mea- 
sures by the spectral theorem. 

We will use J to denote the Jacobi matrix (a n > 0) 

/ bx ai . . .\ 
a% 62 Cb2 • ■ 
a 2 63 • • 

v / 



J 



(1.1) 



v will normally denote the spectral measure of the vector b\ G £ 2 (Z + ) 
and P n (x) the orthonormal polynomials. 

We are interested in J's close to the free Jacobi matrix, Jo, with 
b n = 0, a n = 1, and du (E) = (2tt)- 1 X[^2,2] V4 - E 2 dE. Most often, 
we will suppose J — J is compact. That means cr css (J) = [—2,2] and 
J has only eigenvalues outside [—2, 2], of multiplicity one denoted E^ 
with £+>£+>•••> 2 and E{ < E 2 < ■ ■ ■ < -2. 
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One of the main objects of study here is the Szego integral 

^ J ' = 2^ J 2 11 V 2ndu ac /dE ) ^A^W "' 2) 



The Szego integral is often taken in the literature as 

/ du ar \ dE 



dE J V4-£ 2 

which differs from Z(J) by a constant and a critical minus sign (so the 
common condition that the Szego integral not be — oo becomes Z(J) < 
oo in our normalization). There is an enormous literature discussing 
when Z(J) < oo holds (see, e.g., |, g | g 0, 01 |15|, [U| ^ H). 
It can be shown by Jensen's inequality that Z(J) > — |ln(2) so the 
integral can only diverge to +oo. 

We will focus here on various sum rules that are valid. One of our 
main results is the following: 

Theorem 1. Suppose 

A (J)=hm (-J2H*n)) (1.3) 

v n=l 7 

exists (although it may be +oo or — oo). Consider the additional quan- 
tities Z(J) given by (|1.2|) and 



;i.4) 



// any two of the three quantities A (J), Sq(J), and Z(J) are finite, 
then all three are, and 

Z(J) = A (J)+S (J) (1.5) 
Remarks. 1. It is not hard to see that Sq(J) < oo if and only if 



EEvW) 2 - 4 <°° ^ 

± j 

2. The full theorem (Theorem |4.1| ) does not require the limit ( |1.3|) 

to exist, but is more complicated to state in that case. 

3. If the three quantities are finite, many additional sum rules hold. 

4. This is what Killip-Simon JlO| call the Co sum rule. 

5. Peherstorfer-Yuditskii jrjj (see their remark after Lemma 2.1) 
prove that if Z (J) < oo, S (J) = oo, then the limit in (|1.3j ) is also 
infinite. 
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Theorem 1 is an analog for the real line of a seventy- year old theorem 
for orthogonal polynomials on the unit circle: 

ir ><^)«»=j>(i -!«,!•) d.7) 

where {ctj} ( *L 1 are the Verblunsky coefficients (also called reflection, 
Geronimus, Schur, or Szego coefficients) of v. This result was first 
proven by Verblunsky p6[ in 1935, although it is closely related to 
Szego's 1920 paper 



For J's with J — J finite rank (and perhaps even with Y2^=i n (\ a n~ 
1| + \ b n \) < oo)? the sum rule ( |1.5| ) is due to Case 0. Recently, Killip- 
Simon JTO] showed how to exploit these sum rules as a spectral tool 
(motivated in turn by work on Schrodinger operators by Deift-Killip 
jl| and Denissov ||). In particular, Killip-Simon emphasized the im- 
portance in proving sum rules on as large a class of J's as possible. 

One application we will make of Theorem |T] and related ideas is to 
prove the following (= Theorem |5.2|) : 



Theorem 2. Suppose a ess (J) C [—2,2] and Q1.6 ) holds. Then Z {J) < 
oo if and only if 

N 

(1.8) 



liminfl — ^^ln(a n ) j < oo 

^ n=l ' 



Moreover, if these conditions hold, then 

(i) The limit Aq(J) in ( |1.3| ) exists and is finite. 

(ii) liniTv^oo J2n=i bn exists and is finite. 
(iii) 

oo oo 

J> n -1) 2 + ]T^ <OC (1.9) 

ra=l n=l 

Results of this genre when it is assumed that er( J) = [—2, 2] go back 
to Shohat [21] with important contributions by Nevai ||13|| . The precise 



form is from Killip-Simon | 10| . Nikishin |15| showed how to extend 



this to Jacobi matrices with finitely many eigenvalues. Peherstorfer- 
Yuditskii [0 proved Z(J) < oo implies (i) under the condition £ ( J) < 
oo, allowing an infinity of eigenvalues for the first time. Our result 
cannot extend to situations with £q{J) = oo since Theorem || says if 
(i) holds and Z(J) < oo, then £q{J) < oo. 
We will highlight one other result we will prove later (Corollary 6.3). 

Theorem 3. Let a n , b n be Jacobi matrix parameters so that 

lim n(a n — 1) = a lim nb n = (5 (1-10) 



n^oo 
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exist and are finite. Suppose that 

\P\ > 2a (1.11) 

Then Z(J) = oo. 

Remark. In particular, if a < 0, ( |1 . 1 1| ) always holds. (|1 .11 ) describes 
three-quarters of the (2a, (3) plane. 

In Section [(], we will discuss the background for this result, and de- 
scribe results of Zlatos |27j that show if \/3\ < 2a and one has additional 
information on the approach to the limit ( |1.10 ), then Z(J) < oo. Thus 
Theorem 3 captures the precise region where one has ( |1.10| ) and one 
can hope to prove Z(J) = oo. 

Theorem 3 will actually follow from a more general result (see The- 
orem [OJ, and §3) . 

Theorem 4. Suppose ( |1.9| ) holds and that either limsup(— X]j=i( a j ~ 
l + \bj)) = oo or limsup(— Y^=i( a j~ 1 — 2^')) = 00 ■ Then Z(J) = 00. 

The main technique in this paper exploits the m-function, the Borel 
transform of the measure, v: 

mm = / ^ (1 , 2) 

Since ^ is supported on [—2,2] plus the set of points {E, }, we can 
write 

It is useful to transfer everything to the unit circle, using the fact 
that z 1— > -E = z + z _1 maps © = | |z| < 1} onto the cut plane 
C\[— 2,2]. Thus we can define for \z\ < 1 

M(z) = -m u (z + z~ l ) (1.14) 

The minus sign is picked so ImM(z) > if Imz > 0. We use M(z; J) 
when we want to make the J-dependence explicit. 

In ( |1.13|) , we translate the pole term directly. Define /3j so 

Ef = Pf + (Pf)- 1 (1-15) 

with > 1. We sometimes drop the explicit ± symbol and count 
the /3j's in one set. We define a signed measure d\i# on (0, 27r) by 
Im M(re l6 ) — > dyft weakly as r f 1. /i* is positive on (0, tt) and 
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negative on {ix,2ix). Thus (|1.13|) implies 



(1.16) 

where we use ^({Z^ -1 }) for the weights u({Ej}) and P(z, tu), with \z\ < 
1, |u>| = 1, is the Poisson kernel 



1 


- M 2 


\z 


— w\ 2 



P(z,w) = , ~~~\o (1-17) 



or 

.2 



P(re* e , e*) = P r (6 - <p) = \ , (1.18) 

1 + r z — 2r cosyU — <p) 

We note now that since fi({Pj~ 1 }) are point mass of a probability mea- 
sure, we have 

Ew 1 })^ 

3 

It is useful to use the fact that /i* is odd under reflection to rewrite 
(|1.16| ) in the form 

hnM(re») = Im^ ^~ _f _\7 V. - ^ + =- / D r {6,<p)dn{<P) 

(1.19) 

where 

D r (e, y») = p r (e, <p) - p r (e, -y) (i.20) 



This is because M(z) = M(z), so that /x* f [— tt, 0] is a reflection across 
1R of /! = /i # I" [0, n}. 

Note that not only does 1m M(re %e )d6 converge weakly to pr, but 
by general principles f20|| , lim r -|-i M(re t0 ) = M(e %e ) exists for a.e. # and 
d/j,*=lmM(e ie )d9. 

Section g|, the technical core of the paper, proves some convergence 
results about integrals of ln[ImM(re* 9 )]. It is precisely such integrals 
that arise in Section [| where, following Killip-Simon ||10|| , we use the 
well-known 

— m(z; J)' 1 = z — b\ + afm(z; J^) 

where is J with the top row and leftmost column removed. We will 
be able to prove sum rules that compare J and j' 1 '. In Section || we 
will then list various sum rules, including Theorems 1 and 4. Section [| 
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proves Theorem 2 and Section |6] discusses Coulomb Jacobi matrices 
(J — Jo decays as n~ l ) and Theorem 3 in particular. 

It is a pleasure to thank Mourad Ismail, Rowan Killip, and Paul 
Nevai for useful discussions. 



2. Continuity of Integrals of ln(ImM) 

In this section, we will prove a general continuity result about bound- 
ary values of interest for M-functions of the type defined in ( 1.18 ). We 



will consider suitable weight functions, w((p), on [0, 7r], of which the 
examples of most interest are w((p) = sin fc (<^), k = or 2. Our goal is 
to prove that 

lim J ln{Im M(re iv )]w((p) dtp = J ln[ImM(e^)] w(<p) dcp (2.1) 

and that the convergence is in L 1 if the integral on the right is finite. 
All integrals in this section are from to tt if not indicated otherwise. 
We define 

d((p) = min(<yj, 7r — tp) (2.2) 

and we suppose that 

< w((p) < dd^Y 1 ^ (2.3) 
for some C\, a > and that w is C 1 with 

\w'{ip)w{ip)- l \ < C 2 d(p)-P (2.4) 

for Cii 13 > 0. For weights of interest, one can take a — (3 — 1. 

Remarks. 1. For the applications in mind, we are only interested in 
allowing "singularities" (i.e., w vanishing or going to infinity) at or 
Ti, but all results hold with unchanged proofs if d(ip) = min{|<^ — ipj\} 
for any finite set {y?j}. For example, w((p) = sin 2 (m(p) as in |TJ] is fine. 

2. Note that by (gT3j) , f£ w((p) dcp < oo. 

The main technical result we will need is: 

Theorem 2.1. Let M be a function with a representation of the form 
(|1.19f) and let w be a weight function obeying (|2.3|) and (|2.4|) . Then 
( |2~T| ) holds. Moreover, if 

J ]n[lmM{e i<p )}w{ip) dtp > -oo (2.5) 

(it is never +oo) ; then 

lim / |m[Im M(re^)] - ln[Im M(e ilp )} I w(<p) d<p = (2.6) 

r|l J 
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Let ln ± be denned by 

ln±(y) = max(0, ±ln(y)) 

so 

ki(y) = ln+(y) - ln_(y) 
|ln(y)| = ln+(y)+ln_(j/) 
We will prove Theorem ^J] by proving 

Theorem 2.2. For any a > and p < oo, ln+[Im(M (e %v ))/a\ G 
L p ((0, 7r), w (<p) d(p), and 

Tm M(re itp )\ /ImM(e^) x p 



lim 

rfl 



In, 



-lm 



V a 



dip = (2.7) 



Theorem 2.3. For any a > ; we have 
'Im M(re itp 



lim / ln_ 

rfl 



ln_ 



ImM(e^; 



w(^) aV 
(2.* 



Proof of Theorem \2.J\ given Theorems \2.2\ and \2.3j . By Fatou's lemma 
and the fact that for a.e. <p, Im M(re ¥ ) — > ImM(e ¥ ), we have 

liminf J ln_[ImM(re i¥? )] w(ip) d<p> J ln_[Im M(e itp )} w(ip) dip 

(2.9) 

Since Theorem ^72] says that sup 0<r<1 f ln + \ImM(re l ' p )]w(ip) dip < oo, 
it follows that if J ln_[Im M(e i(fi )]w(ip) dip = oo, then fl2l]) holds. 
If fO) holds, then 



lim / lm 



ImM(e^) 



w(ip) dip = 



since ln_(y/a) is monotone decreasing to as a decreases. Given e, 
first find a so 

'lmM(e^) 



ln_ 



and then, by (|2.8| ), r x < 1 so for r x < r < 1 

"Im M(re^ " 



ln_ 



By Q2.7|), find r 2 < 1, so for r 2 < r < 1, 



In, 



Im M(re 



In. 



ImM(e^) 



tu(<£>) d(y9 < 
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Writing 
|ln(a)-ln(/?)|< 



In, 



-ln 4 



+ ln_ 



+ ln_ 



we see that if max(ri, r^) < r < 1, then 

I ln[Im M(re ilp )} - ln[Im M(e iip )} \w(<p)d<p<e 



so COD holds. 



□ 

We will prove Theorem [2.2| by using the dominated convergence the- 
orem and standard maximal function techniques. Given the measure \i 
on (0, 7r), we define jl to be the sum of \i and its reflection on (tt, 2tt), 
and its maximal function by 

ji(x — a,x + a) 



sup 

0<a<n 



2a 



The Hardy-Littlewood maximal inequality for measures (see Rudin 

3/i(0, it) 



20| ) says that 



\{x\^(x)>\}\< 



A 



(2.10) 



Lemma 2.4. Let M be an M -function based on a measure /i on [0, tt] 
and weights at the poles at and let a be a sum of the weights 

of the poles. Then for < r < 1, 

lmM(re w ) < fi*(9) + ar -1 [siii(0)]- 2 (2.11) 
Proof. Since D r (9, (p) < P r {0, (p) and P r is a convolution operator with 



a positive even decreasing function of <p on [0,?r] with Sq Pr{f) dp/ 2 
1, we have, by standard calculations, that 

A.(^)^<//(0) 



7T 



2tt 



On the other hand, for (3 > 1, 
1 



2 + z 



-l 



-i 



< 





z\ 




Im 2 


2 



< 



1 



rsin 6 



if z = re td , so summing the pole term shows, 



Ml/V 1 }) 

z + z" 1 - pi- /T 1 



< 



r sin 2 



□ 
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Proof of Theorem \2.3j . Let 

Me) = fi*(6) W) = 2[sin(^)]- 2 

For a.e. 9, ln+[(Im M(re ie ))/a] -> ln + [(ImM(e*))/a]. By (|CT| ) for all 
| < r < 1, ln + [(ImM(re i0 ))/a] < ln+[(/i(0) + f 2 {0))/a]. Thus if we 
prove that for all p < oo, 



In, 



A + /a 



w(<^) dip < oo 



we obtain ( [2.71) by the dominated convergence theorem. Since 

|hi+(x)| p < C(p,g)|x| 9 
for any p < oo, g > 0, and suitable C(p, q) and 

+ < 2 9 |x| 9 + 2%| c ' 
it suffices to find some q > 0, so 

(i/i(^)i 9 + i/ 2 ^)n^)^<oo 

Since for v ~ 1 + = 1, 

|/i(¥»)IMv)^< ( f \fxi}P)r ' ( fW<P)\* d<P 



and w(ip) G L* for some t > 1 by ( |2.3|) , it suffices to find some s > 
with 



(i/i(^)i s + i/2(^)n^<oo 



(2.12) 



By ( gJXj|) , /|/i(¥»)| 8 dv < oo if s < 1 and clearly, f |/ 2 (</?)| s < oo if 



s< ±. 



□ 



As a preliminary to the proof of Theorem 2.3, we need 



Lemma 2.5. Let if o6e?/ ( |2.4j) . Let < (p < n and let fi,<f2 G [0, 7r] 



(a) d(y?i) > d(y? ), > d(ip ) 

(b) fa - 1 < d(ip ) p 



Then for C 3 = C 2 exp(C 2 ), 
w(<p 3 ) 



< C 3 \ipi - ip 2 \ d((p ) 



(2.13) 
(2.14) 

(2.15) 
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Proof. 



w(<pi 



w(ip 2 ) 



- 1 



cxp 



■-p-1 



w'(rj) 



dq\-l 



< |exp(C 2 |^ 2 -^ 1 |^ )~ /3 )-l| 



(2.16) 



by 



and (PI) . But \e x - 1| < e'*l|a:|, so by (gig ), 



w((^ 2 ) 



- 1 



< C 2 exp(C2)|v5i — ¥? a | c2(^o) ^ 



which is ( p,15| ). 

We will also need the following pair of lemmas: 
Lemma 2.6. Let 0<n<6<7T — r\ and 

N r (e,v) 



□ 



Then 



0<[1-N r (9,r ] )}<^-^1 + 1 ' 



Proof. We have 



2tt 



r sin 6 1 r sin" 77 



(2.17) 



2tt 



so since D r < P r , N r < 1 and 



1-N r (9,7i) < — I P r (6,<p)dtp + — 



2-K J ^6[0,7r] 
(p|>»7 



Now 



Pr(0,V>) 



1-r 2 



< 



(1 - r ) 2 + 4rsin 2 [|(0-^)] 
2(1 -r) 



P r (9, V )dp (2.18) 



(2.19) 



< 



4r sin 2 [± (#-</?)] 

2(1 - r) 
r sin 2 (# — (/?) 



The first integrand in ( j2.18j ) is thus bounded by 2r x (l — r) [sin 2 (0)] 1 
and the second by 2r~ 1 (l — r) [sin 2 (77) J -1 , so ( |2.17|) is immediate. □ 

Lemma 2.7. If /* lmM(e id ) d6 ^ 0, then for 9 G [0, tt], r G (§, 1), 

Im M(re i0 ) > c(r 1 — r) sin 6* (2.20) 
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Proof. In terms of the real line m function, for E 2 > 0, E\ real, 

E 2 f 2 lmm(E)dE 

since we have dropped the positive contributions of z/ sing to Im(— m). 
Now if z = re 10 , 

M(z) = -m(Ex - %E 2 ) 

where z + z~ l — E\ — iE 2 or E\ = (r + r _1 ) cos#, E 2 = (r -1 — r) sin#. 
If r > i, then \Ei\ < |, |E 2 | < |, and in (PT2TD , \E\ < 2. Thus 

ImM(z) > cE 2 (z) 

which is (1^3). □ 

Proof of Theorem \2. 3j . Since ln_ is a decreasing function, to get upper 
bounds on ln_ [Im M{re l9 ) /a], we can use a lower bound on Im M. The 
elementary bound 

ln_ (ab) < ln_ (a) + ln_ (b) (2.22) 

will be useful. 

As already noted, Fatou's lemma implies the liminf of the left side 
of ( |2.8|) is bounded from below by the right side, so it suffices to prove 
that 

r, flm M(re^)\ , , , f, /ImM(e ! ")\ , , , 
hmsup / ml )w({p) dip < / In I )w((p)d{p 

rn Jo V a J Jo V a J 

(2.23) 

Pick 7 and k so < max(/3, 1)7 < k < ~ and let 9 (r) = (1 — r) 7 , 
r/(r) = (1 — r) K . We will bound Im M(re ) from below for d(6) < 6o{r) 
using ( |2.20| ), and for d{6) > 9o(r), we will use the Poisson integral for 
the region \(p — 9\ < i](r). 

By (pD and (H), 

I ^ Am M(re^) \ ( ^ ^ ^ a ^ [ln(r -i _ r) + ln _ 6q] 

Jd(v)<e {r) \ a / 

which goes to zero as r f 1 for any a. So suppose d{9) > 9 . Write 

f8+V(r) 1 

\mM{re ie )> D r (9, cp) lmM(e ltp ) — 

Je-r)(r) 27T 

For later purposes, note that for d{9) > 9 , ( j2. 17j ) implies 

< 1 -N r {e,Tj) < C(l -rf- 2K (2.25) 
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which goes to zero since k < |. Using ( |2.24j) and ( |2.22| ), we bound 

ln_[Im M(re ie )/a] as two ln_'s. Since ln_ is convex and D r (9, ip)/27rN r (9, rj) 

X(e-v,9+v) (v 9 ) dtp is a probability measure, we can use Jensen's inequality 

to see that 

,„ N1 rim M{re m 
w{9) m_ 



<w(9) ln_[iV r (0,77)] + 



e+n{r) w{9)_ D r (9,<p) 
e- v{r ) w{ip) N r (9,T]) 



w((p) ln_ 



ImM(e 
a 

(2.26) 



In the first term for the 0's with d(9) > 9o(r), N r obeys ( [2.25D so 



/ 

JdU 



w{9) ln_[N r (9,r])]d9 = 0((1 - r 







(2.27) 



ld(9)>e (r) 

In the second term, note that for the #'s in question, N r (9,r])^ 1 — 1 = 
0((1 - r) 1 " 2 *) and by (p7T5| ), w(9)/w((p) - 1 = 0((1 - r) K ~^). Since 
D r (9, (f) < P r {9, if), we thus have 



/ 

■hi 



ln_ 



Im M(re i( 



w{9) d9 



d(e)>e 

< 0((1 - r) 1 - 2 ' 1 ) + [1 + 0((1 - rf- 2K )][l + 0((1 - r) K "^)] 

'ImM(e ,(0 



fd(8)>e 

\<f>—9\<ri 



P r (0,<p)w(<p) In. 



<fy> — (2.28) 



Since the integrand is positive, we can extend it to {(9,ip>) \ 9 G 
[0,27r],<^ G [0,7r]} and do the 9 integration using J P r (9,(p)d9/2ii = 1. 
The result is ([03]). □ 



dip 
2^ 



This concludes the proof of Theorem |2.1| . By going through the 
proof, one easily sees that 

Theorem 2.8. Theorem \2. i| remains true if in ( |2.1|) and Q2.6| ), ln[Im M(re l(p )\ 
is replaced by \n[g(r) simp + lmM(re tip )} where g(r) > and g(r) — > 
as r | 1. 

Proof. In the ln + bounds, we get an extra [supi <r<1 g(r)} sin 9 in /^(^O- 
Since we still have pointwise convergence, we easily get the analog of 
Theorem [2.2| . In the proof of Theorem [2.3| , Fatou is unchanged since 
g{r) — > 0, and since 

\n-(g(r) simp + ImM(re^)) < ln„(Im M(re itp )) 

the lim sup bound has an unchanged proof. □ 
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3. The Step-by-Step Sum Rules 

We will call J a BW matrix (for Blumenthal-Weyl) if J is a bounded 
Jacobi matrix with cr ess (J) = [—2, 2], for example, if J — J is compact. 
Let be the matrix resulting from removing the first n rows and 
columns. Let {E^(J)}'^ =1 be the eigenvalues of J above/below ±2, 
ordered by ±Ef > ±Ef > • • • with Ef{J) defined to be ±2 if there 
are only finitely many eigenvalues k < j above/below ±2. Then by the 
min-max principle, 

±Ef +n (J) < ±Ef(jV>) < ±Ef{J) (3.1) 

We have linx,-^ Ef{J) = ±2 if J is a BW matrix. 

It follows by the convergence of sums of alternating series that if / 
is even or odd and monotone on [2, oo) with f{2) = 0, then 



N 

lim E " /( = SW) (3-2) 

TV— >oo ' ' ' 4 

± j=l 



exists and is finite. If j3f is defined by Ef = Pf + iPf)' 1 with \/3j\ > 1, 
we define X^ n \j) as Sf n (J) for 

m -\-l^-^] £=1,2... (3 - 3) 
In addition, we will need 



E?=i H^) i = o 



^]hn mr ^ 0O [Tr(T i aj miF )) -Ti{T,{\j^_ n . F ))} £=1,2, 



(3.4) 



where J m -F is the finite matrix formed from the first m rows and 
columns of J and Ti is the £-th Chebyshev plynomial (of the first kind). 
As noted in |TD|, Proposition 4.3], the limit in ( |3.4Tj exists since the ex- 
pression is independent of m once m > i + n. 
Note that 

n 

Ci (n) C/) = I> (3-5) 

3=1 
n 

d n) u) = E^ 2 + K 2 - 1 ) (3-6) 



as computed in |L0 
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Note that by construction (with = J), 



n-l 



xf ) (J) = E X i 1) ( j0 ' ) ) ( 3 - 7 ) 

3=0 



and 



n-l 



cfV) = Ed 1 V°' ) ) (3.8) 



j=0 



As final objects we need 



and for £ > 1, 

Zf{J) = / In I - — ) (1 ± cos(£9)) d9 (3.10) 



1 f 2w , ( sin i 



4tt 7 \lmM(e id ,J) 

We include "sin#" inside ln(. . . ) so that Z(J ) = Zf{J Q ) = Y t (J ) = 
because M(z, Jo) = z. 
Of course, 

Zf{J) = Z{J)T\Y,{J) (3.12) 

when all integrals converge. By Theorem |2.2j , the ln_ piece of the 
integrals in ( |3 . 9| ) — (|3TTT| ) always converges. Since 1 ± cos(£9) > 0, the 
integrals defining Z{J), Zf{J) either converge or diverge to +oo. We 
therefore always define Z(J) and Zf(J) although they may take the 
value +oo. Since [1 ± cos(i9)\ < 2, Z(J) < oo implies Zf(J) < oo, so 
we define Y t {J) by (|3TT2D if and only if Z(J) < oo. 

If Z(J) < oo, we say J obeys the Szego condition or J is Szego. If 
Zf{J) < oo, we say J is Szego at ±2 since, for example, if Z^(J) < oo, 
the integral in ( |3.9| ) converges near 9 = (E = 2 cos(#) near +2) and 
if Zi(J) < oo, the integral converges near 9 = n (i.e., E = —2). Note 
that while Z^(J) < oo only implies convergence of ( p.9|) at 6* = 0, it 
also implies that at 9 = tt the integral with a sin 2 inserted converges 
(quasi-Szego condition). 

Our main goal in this section is to prove the next three theorems 

Theorem 3.1 (Step-by-Step Sum Rules). Let J be a BW matrix. Z(J) < 
oo if and only if Z(J^) < oo, and if Z(J) < oo, we have 

Z{J) = -\n( ai )+x£\j) + Z{JW) (3.13) 
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Y e (J) = C, (1) ( J) + Xf\j) + Y t { J«); i = 1, 2, 3, . . . (3.14) 

Remarks. 1. By iteration and ( |3.7| )/ ( pT8|) , we obtain if Z(J) < oo, 
then Z(jW) < oo and 

n 

Z(J) = -Y,H^)+X^\.J)+Z(J^) (3.15) 
i=i 

F,(J) = &\J) + X, (n) (J) + W (n) ); I = 1, 2, 3, . . . (3.16) 
2. We call (|3.13|) / (|3.14|) the step-by-step Case sum rules. 

Theorem 3.2 (One-Sided Step-by-Step Sum Rules). Let J be a BW 

matrix. Zf(J) < oo if and only if Zf(J^) < oo, and if Zf(J) < oo, 
then we have for i = 1, 3, 5, ... , 

Zf{J) = -ln( ai ) T lcf\j)+X£\j) T\xf\j) + Z±(J«) 

(3.17) 



Remark. Theorem 13.21 is intended to be two statements: one with all 



the upper signs used and one with all the lower signs used. 

Theorem 3.3 (Quasi-Step-by-Step Sum Rules). Let J be a BW ma- 
trix. Z^{J) < oo if and only if Z^iJ^) < oo, and if Z^{J) < oo, 
then for £ = 2,4,..., we have 

Zf(J) = -111(0!) + iciV) +x^(J) + \xf\j) + Zi(jW) 

(3.18) 

Remarks. 1. The name comes from the fact that since 1 — cos 29 = 
2 sin 2 9, Z^iJ) is what Killip-Simon [10] called the quasi-Szego integral 



1_ r 2 * f sin9 
2W n \lmM{e ie ,J) 



r-(J) = — / In - — sin^rfg (3.19) 



2. Since Z(J) < oo implies Z^(J) and Z-f (J) < oo, and Z±(J) or 
Zi(J) < oo imply Z^{J) < oo, we have additional sum rules in various 
cases. 

3. In fOll , Laptev et al. prove sum rules for Zj{J) where t = 



4,6,8,.... One can develop step- by-step sum rules in this case and 
use it to streamline the proof of their rules as we streamline the proof 
of the Killip-Simon P2 rule (our Z2 sum rule) in the next section. 

The step-by-step sum rules were introduced in Killip-Simon, who 
first take r < 1 (in our language below), then take n — > 00, and then 
r | 1 with some technical hurdles to take r j 1. By first letting r j 1 
with n < 00, and then n — > 00 as in the next section, we can both 
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simplify their proof and obtain additional results. The idea of using 
the imaginary part of 

-M(z; J)' 1 = -(z + z- 1 ) + bx + a\M{z; J (1) ) (3.20) 

is taken from Killip-Simon [ p~0f| . 

Proof of Theorem \3.J\ . Taking imaginary parts of both sides of ( |3.20| ) 
with z = re %e and r < 1, we obtain 

[Im M(re w ; J)] \M(re i0 ; J)\~ 2 = (r" 1 - r) sin# + a\ lmM(re w ; J (1) ) 

(3.21) 



Taking ln's of both sides, we obtain 

sin 9 
lmM(re id ; J) 



M t ,T 9 i0 T , )=h + t 2 + t 3 (3.22) 



where 



U = -21n|M(re ie ; J)\ (3.23) 
t 2 = -2\na l (3.24) 
/ s i n Q \ 

* 3 ~ ln \g(r) sm6 + lmM(re ie ; jW) ) 



where 



Let 



g(r) = a^ 2 (r- 1 -r) (3.26) 

= mm 

rz 

so /(0) = 1 (see (|3.20| )). In the unit disk, f(z) is meromorphic 
and has poles at {r{fij\ J)) -1 | j so that Pf(J) > r^ 1 } and zeros at 
{r(/? J ± ( J^ 1 ^)) -1 | (3p(J^) > r -1 }. Thus, by Jensen's formula for /: 

— [ tl d6 = -lnr+ V lnlr-^ffJ)!- V lnlr-^fJ^)! 

/3±(J)>r-i /3±(j(i))>r-! 

By (|3 . 1| ) , the number of terms in the sums differs by at most 2, so that 
the ln(r _1 )'s cancel up to at most 21n(r~ 1 ) — > as r | 1. Thus as r f 1, 

^- / (t 1 +t 2 )d9^-\n{a 1 )+x£ ) (J) (3.27) 
4vr Jo 

It follows by ( $722\ ) and Theorems [O] and |2T| (with w(^) = 1) that 
Z(J) < oo if and only if Z(J^) < oo, and if they are finite, ( |3.13|) 
holds. 
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It also follows that if Z(J) < oo, we have L 1 convergence of the ln's 
to their r = 1 values. That implies convergence of the integrals with 
cos(£8) inside. Higher Jensen's formula as in [Kj then implies ( |3.14|) . 



In place of ln|/3r _1 |, we have (r(3) — (r(3)~ , but the sums still converge 
to the r = 1 limit since we can separate the (3 e and (3~ e terms, and 
then the r's factor out. □ 



Proofs of Theorems and \3. j| . These are the same as the above proof, 
but now the weight w is either 1 ±cos(#) or 1 — cos(2#) and that weight 
obeys (O) and (E3j). □ 



Corollary 3.4. Let J be a BW matrix. If J and J differ by a finite 
rank perturbation, then J is Szego (resp. Szego at ±2) if and only if J 
is. 

Proof. For some n, = J^ n \ so this is immediate from Theorems |37l 



and K2. □ 



Conjecture 3.5. Let J be a BW matrix. If J and J differ by a trace 
class perturbation, then J is Szego (resp. Szego at ±2) if and only if J 
is. It is possible this conjecture is only generally true if J — J is only 
assumed compact or is only assumed Hilbert-Schmidt. 

This conjecture for J = Jo is Nevai's conjecture recently proven by 
Killip-Simon. Their method of proof and the ideas here would prove 
this conjecture if one can prove a result of the following form. Let J, J 
differ by a finite rank operator so that by the discussion before ( p.2|) . 

N , s 

hEE v^-\W^H (JiJ) 

± 3=1 V 7 

exists and is finite. The conjecture would be provable by the method 



of |]T0|| and this paper (by using the step-by-step sum rule to remove 
the first m pieces of J and then replacing them with the first m pieces 
of J) if one had a bound of the form 

\5(J, J) | < (const. )Tr(| J - J|) (3.28) 



(|3.28|) with J = J is the estimate of Hundertmark-Simon 0. We 



have counterexamples that show ( [3.28] ) does not hold for a universal 
constant c. However, in these examples, || J\\ — > oo as c — > oo. Thus it 
could be that ( |3.28| ) holds with c only depending on J for some class 
of J's. If it held with a bound depending only on || J||, the conjecture 
would hold in general. If J was required in J + Hilbert-Schmidt, we 
would get the conjecture for such J's. 
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4. The Zq, Z^, and Z 2 Sum Rules 

Our goal here is to prove that sum rules of Case type hold under 
certain hypotheses. Of interest on their own, these considerations also 
somewhat simplify the proof of the P 2 sum rule in Section 8 of []TD| , and 



considerably simplify the proof of the Cq sum rule for trace class J — Jo 
from Section 9 of |TIJ. Throughout, J will be a BW matrix. There are 
two main tools. As in [IC(], lower semicontinuity of the Z's in J (in the 
topology of pointwise convergence of matrix elements) gets inequalities 
in one direction. We use step-by-step sum rules and boundedness from 
below of Z for the other direction. 

We first introduce some quantities involving a fixed Jacobi matrix: 

Ap(J) = lim sup ( - ln(qj) ) (4.1) 

n— >oo V ■ n / 

A (J)=liminff-^ln(a i ) N j 

\ j=0 / 



(4.2) 



( n 

Af(J) = limsup I — — 1 ± \bj 

A±(J)=liminf i-Y^{a j -l±\b j ) 

00 

^(J) = ^[^ 2 + |G(a J )] (4.3) 



where 



G(a) = a 1 - 1 - ln(a 2 ) 



Since G(a) > 0, the finite sums have a limit (which may be +00). 

We note that for a near 1, G(a) ~ 2 (a — l) 2 . Thus A 2 (J) is finite if 
and only if J— J is Hilbert-Schmidt. In ( |4.2[ ) , we can use — 1 in place 
of ln(a 3 -) because if {aj — 1} E £ 2 (e.g., if J — Jo is Hilbert-Schmidt), 
then ^|ln(aj) — (aj — 1)| < 00. 

We also introduce some functions of the eigenvalues: 

£b(.7) = X>l# I (4-4) 



«fw = EvW 2 - 4 (4 - 5) 
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S 2 (J) = J2 F (Ef) (4.6) 

i,± 

where F(E) = \[(3 2 - (3~ 2 - ln(/? 4 )] with E = (3 + (3~ l and \(3\ > 1. For 
|£| ~ 2, F(£) is 0((\E\ - 2f/ 2 ). In O and Q), we sum over + 
and — . In (|4.5| ), we define 8f and £f with only the + or only the — 
terms. 

We need the following basis-dependent notion: 
Definition. Let B be a bounded operator on £ 2 (Z + ). We say B has a 
conditional trace if 

t 

lim ^(^,^)=c-TY(S) (4.7) 

£— >oo ^— ' 

exists and is finite. 

If £> is not trace class, this object is not unitarily invariant. 

Our goal in this section is to prove the following theorems whose 
proof is deferred until after all the statements. 

Theorem 4.1. Let J be a BW matrix. Consider the four statements: 

(i) Aq{J) > -oo 

(ii) Ao(J) < oo 
(hi) Z(J) < oo 
(iv) 8q(J) < oo 
Then 

(a) (ii) + (iv) => (iii) + (i) 

(b) (i) + (hi) =► (iv) + (ii) 

(c) (iii) A (J) < oo 

(d) (iv) Aq{J) > -oo 

Thus (iii) + (iv) =>- (i) + (ii). In particular, if A (J) = A (J), that 
is, the limit exists, then the finiteness of any two ofZ(J), 8q(J), and 
Aq(J) implies the finiteness of the third. 

If all four conditions hold and J — Jo is compact, then 
(e) 

lim (-J2H^)) = A (J) (4.8) 

V j=l / 

exists and is finite, and 

Z(J) = A (J)+8 (J) (4.9) 
(f) For eac/i £ = 1,2,..., 

-£r 1 [#(J)<-#(J)-<] ^Xf } (J) (4.10) 
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converges absolutely and equals lim n ^oo ( J) . 

(g) For each £=1,2,..., 

B<(J) ^{ ri g)_ T( (|)} ( , n) 

has a conditional trace and 

c-Tr(5,(J)) = lim (4.12) 

n— >oo 

/or example, if £ = 1, Y^=i^j converges to a finite limit. 

(h) The Case sum rule holds: 

Y t {J) = c-Tr(J? £ (J)) + Xf°\j) (4.13) 

where Y e is given by ( |3.11|) , by (|4.10| ), and c-Ti(B e (J)) by 

0, (PH) , «nd CT) . 

Remarks. 1. In one sense, this is the main result of this paper. 

2. We will give examples later where A (J) = Aq(J) and one of the 
conditions (i)/(ii), (iii), (iv) holds and the other two fail. 

3. For £ odd, Ti(J /2) vanishes on-diagonal. By Proposition 2.2 of 



i0| and the fact that the diagonal matrix elements of Jq are eventu- 



ally constant, it follows that for £ even, T^(J /2) eventually vanishes 
on-diagonal and c-Tr(T^( J /2)) = — |. Thus (g) says c-Tr(T^( J/2)) ex- 
ists and the sum rule ( |4.13|) can replace c-Tr(£^(J)) by c-Tr(7}( J/2)) 
plus a constant (zero if £ is odd and l/£ if £ is even). For £ even, 
c-Tr(T^( J /2)) = — | while Tr(T^( J 0in; i?/2)) = —1 for n large because 
Ti(J 0>n . F /2) has two ends. 

Corollary 4.2. LetJ—J be compact. IfZ(J) < oo, then — X]j=i l n ( a j) 
either converges or diverges to — oo. 

Remarks. 1. We will give an example later where Z(J) < oo, and 
lim ri _ +00 (-^=i ln ( a i)) = -oo- 

2. In other words, if J — Jo is compact and Aq(J) ^ Aq(J), then 
Z(J) = oo. 

3. Similarly, if J — J is compact and £q(J) < oo, then the limit 
exists and is finite or is +oo. 

Proof. If Z(J) < oo and Aq > — oo, then by (b) of the theorem, all four 
conditions hold, and so by (e), the limit exists. On the other hand, if 
A = — oo, then A = A = — oo. □ 

Corollary 4.3. If J — J is trace class, then Z(J) < oo, £q{J) < oo, 
and the sum rules fl4.9|) and ( |4.13| ) hold. 
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Remark. This is a result of Killip-Simon ||10|| . Our proof that Z(J) < 
oo is essentially the same as theirs, but our proof of the sum rules is 
much easier. 

Proof. Since J — Jq is trace class, it is compact. Clearly, Aq = Aq, 
and is neither oo nor — oo since cij > and X!l a j — < 00 i m ply 
X}l m ( a i)l < 00 • By the bound of Hundertmark-Simon |J, £q(J) < oo. 
The sum rules then hold by (a), (e), and (h) of Theorem |4.1| . □ 



Theorem 4.4. Suppose J — J is Hilbert- Schmidt. Then 

(i) Ai < 00 £^ < oo implies Zf < oo. 

(ii) Zf < oo implies Af < oo. 

(hi) Z^ 1 < oo and Af > — oo implies £f < oo. 
(iv) £ x < oo implies A x > — oo. 

Remarks. 1. Each of (i)-(iv) is intended as two statements. 

2. In Section we will explore (ii), which is the most striking of 
these results since its contrapositive gives very general conditions under 
which the Szego condition fails. 

3. The Hilbert-Schmidt condition in (i) and (iv) can be replaced by 
the somewhat weaker condition that 

£(|£f|-2) 3 / 2 <oo (4.14) 



That is true for (ii) and (hi) also, but by the Z 2 sum rule, (|4. 14J) plus 
Zf < oo implies J — Jo is Hilbert-Schmidt. 

Theorem 4.5. Let J be a BW matrix. Then 

Z 2 {J)+£ 2 {J)= A 2 {J) (4.15) 



Remarks. 1. This is, of course, the P 2 sum rule of Killip-Simon |T0| . 
Our proof that Z 2 (J) + £ 2 (J) < A 2 (J) is identical to that in []10f| , but 
our proof of the other half is somewhat streamlined. 

2. As in flOf , the values +oo are allowed in ( [1.15| ). 



Proof of Theorem \4-A - As i n [H9 > ^ ^ n ^ e ^ ne infinite Jacobi matrix 
obtained from J by replacing at by 1 if I > n and be by if £ > n + 1. 
Then ( |3.15| ) (noting Jn = J and Z(J ) = 0) reads 

n 

Z(Jn) = ln (%) + E W I ( 4 -!6) 
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10] , Section 6] implies the eigenvalue sum converges to £q(J) if J — Jo 



is compact, and in any event, is bounded above by £q(J) + Co where 
Co = if J — Jo is compact and otherwise, 

c = ln|/3 1 + (J)|+ln|/3r(J)| (4.17) 

Moreover, by semicontinuity of the entropy [M, Section 5], Z(J) < 
liminf Z(J n ). Thus we have 

Z{J) < A (J) + £ (J) + c (4.18) 



Thus far, the proof is directly from HlQfl . On the other hand, by 
(|3.15| ), we have 

Z{J) > A {J) + liminf X^ n \ J) + liminf Z{J {n) ) (4.19) 

By the lemma below, lim n ^ 00 Xq™^( J) = £q{J). Moreover, by Theo- 
rem 5.5 (eqn. (5.26)) of Killip-Simon 0, Z(J^) > -§ln(2), and if 
j Q j n norm, that is, J — Jo is compact, then by semicontinuity 
of Z, = Z(J ) < liminf Z(J^). Therefore, (gTg ) implies that 

Z(J) > A (J) + £ (J) - c (4.20) 

where 

c = if J — Jo is compact; c = \ In 2 in general (4-21) 

With these preliminaries out of the way, 
Proof of (d). (iv) and (|4.18| ) imply that 

A)(J) > A (J) > Z(J) - £ (J) - c > -oo (4.22) 

Proof of (a). ( |4.18| ) shows Z(J) < oo, and (d) shows that (i) holds. 
Proof of (c). By (|Pq ) and S (J) > 0, 

Z(J) >A (J)-c 
so Z(J) < oo implies A (J) < oo. 

Proof of (b). Since A (J) > -oo and c < oo, (|4.20|) plus Z(J) < oo 



implies £q(J) < oo. (c) shows that (ii) holds. 
Note that (iii), (iv), and (|4.20|) imply that 

M J ) < MJ) < Z {J) ~ £o(J) + | In 2 < oo (4.23) 

Thus we have shown more than merely (iii) + (iv) =^ (i) + (ii), namely, 
(iii) + (iv) imply by ( |Og|) and ()03D 

-oo < A (J) < Aq(J) + | In 2 + c < oo (4.24) 

We can say more if J — Jo is compact. 
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Proof of (e). ( |4.23|) is now replaced by 

MJ)<MJ)<z(J)-s (j) 

since we can take c = in (|4.20| ). This plus ( }4.22|) with cq = implies 
MJ) =A (J) and Q). 

Proof of (f), (g), (h). We have the sum rules fU5l ), (g!6|) . Z(J) ± 



Ye(J) is an entropy up to a constant, and so, lower semicontinous. Since 
|| J< n > - J 1| -> 0, we have 

liminf(Z(J (n) ) ± l^(J (n) )) > (4.25) 

On the other hand, since Z(J^) < oo and S {J {n) ) < S (J) < oo, J(") 
obeys the sum rule (fO|). Since — X]j=i m ( a i) converges conditionally 

m+n 



— y ln(a,) I = 

j=n 



Moreover, Sq(J^) — > by Lemma |4i] below and we conclude that 



limZ(j( n )) = 0. Thus (jOp becomes 

liminf ^(J (n) ) > 0, lim sup Y t (J {n) ) < 



or 



lim Yz(J {n) ) = (4.26) 

n 

By the lemma below, lim n X^ n \j) = X^°°\j) exists and is finite. 

nee S (J) < °° j we have th 
convergent. This proves (f). 



Since S (J) < oo, we have that the sum defining X^°°\j) is absolutely 



By this fact, ( |3.16|) , and ( |4.26|) , lim^oo Q ( J) exists, is finite, and 



obeys the sum rule 

Y t {J)= lim Ct\j) + ^(J) 



By Propositions 2.2 and 4.3 of Killip-Simon [1(J, the existence of 
lim^^oo Q (J) is precisely the existence of the conditional trace. □ 

Lemma 4.6. Let J be a BW matrix. Let f be a monotone increasing 
continuous function on [2, oo) with f(2) = 0. Then 

OO oo 

lim £ [f(E+(J)) - f(Ef(jW))) = f{Ef(J)) (4-27) 
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Remarks. 1. The right side of (|4.27|) may be finite or infinite. 

2. The sum on the left is interpreted as the limit of the sum from 
1 to n as n — > oo, which exists and is finite by the arguments at the 
start of Section 3. 

3. A similar result holds for Ej and / monotone decreasing on 
(-oo,-2]. 

Proof. Call the sum on the left of fl07D (8f)(J,n). Since £+(J (n) ) < 
Ej~(J), we have 

m 

(*/)(J,n) >J2[m(J))-f(E+(jW))} ( 4. 28) 

so, if we show for each fixed j as n — > oo, 

£+(J (n) ) -> 2 (4.29) 
we have, by taking n — > oo and then m — > oo, that 

oo 

liminf(5/)(J,n) > ^/(JEf (J)) (4.30) 
i=i 

On the other hand, since / > 0, for each m, 

m m 

E - /(^;(^ (n) ))] < E / W( J )) 

i=i j=i 
so taking m to infinity and then n — > oo, 

oo 

limsup(5/)(J,n) < E/(^ + (^)) (4-31) 

Thus ( f4.29|) implies the result, so we need only prove that. 

Fix e > and look at the solution of the orthogonal polynomial 
sequence u n = P n {2 + e) as a function of n. By Sturm oscillation 
theory 0, the number of sign changes of u n (i.e., number of zeros 
of the piecewise linear interpolation of u n ) is the number of j with 
Ej~(J) > 2 + e. Since J is a BW matrix, this is finite, so there exist N 
with u n of definite sign if n > Nq — 1. It follows by Sturm oscillation 
theory again that for all j, 

E}{jV>) <2 + e 

ifn>N . This implies flO§. □ 
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The combination of this Sturm oscillation argument and Theorem [O 
gives one tools to handle finitely many bound states as an alternate to 



Nikishin jL5). For the oscillation argument says that if J has finitely 
many eigenvalues outside [—2, 2], there is a with no eigenvalues. On 
the other hand, by Theorem |3.1| , Z( J) < oo if and only if Z(J^) < oo. 

Proof of Theorem \4-3j . Z^{J) is an entropy and not merely an entropy 
up to a constant (see fl(|). Thus > o for all j( n \ Moreover, 

since the terms in A 2 are positive, the limit exists. Thus, following the 
proofs of ( CT) and flOOD but using (gig ) in place of (CT) , 

Z 2 ~(J) + £ 2 (J) < A 2 (J) 

and 

Z-(J)+f 2 (J) > A 2 (J) 

which yields the P 2 sum rule. In the above, we use the fact that in place 
of Z(J) > — |ln(2), one has Z^{J) > 0, and the fact that A 2 (J) < 00 
implies that J — Jo is compact. □ 

Proof of Theorem [^|. Let g((3) = In (3 — 1(/3— (3~ l ) in the region (3 > 0. 
Then 

</(/?) = r x - \ - ¥~ 2 = -¥~\v - 1) 2 

so g is analytic near (3 = 1 and g(l) = g'(l) = g"(X) = 0, that is, 
gifi) ~ c(/3 - l) 3 . On the other hand, h{(3) = In/? + \((3 - ft- 1 ) is 
g((3) + ([3- /3- 1 ) = 13 - (3- 1 + 0((f3 - l) 3 ). S ince (3 + [3~ l = E means 
(3 - /T 1 = V£ 2 - 4 and /3 - 1 = O (y/E-2) , we conclude that 

E > 2 \n(/3) - |(/3 - /T 1 ) = 0(| £ - 2| 3 / 2 ) 

ln(/3) + \{(3 - /T 1 ) = 4W^l + 0(| E - 2| 3 / 2 ) 

while 

£ < -2 =*ln(|/3|) - - (3~ l ) = VW^A + 0(\E + 2| 3 / 2 ) 
\n{\(3\) + \{(3-(3- l ) = 0{\E + 2f 2 ) 
It follows, using Lemma |4.6| , that 

lim X^\j) t k x i n) (J) = £t + bdd 



since Theorem |[5| implies J2j ±{\J Ef 2 — 4 ) 3 < 00 (or, by results of 
H). Thus for a constant C\ dependng only on || J — J0II2, we have 

Z±{J)< Cl +At + £t (4.32) 
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by writing the finite rank sum rule, taking limits and using the argu- 
ment between ( 4.16Q and ( 4.17 ). Since Z x (J) are entropies up to a 
constant, we have Zf{J^) > — C2 and so by (|3.17|) , 



Z?{J) > -c 2 + At + £t 
With these preliminaries, we have 
Proof of (i), (iv). Immediate from ( |4.32| ). 
Proof of (ii). Since Sf > 0, (|433|) implies 

Zf{J) > -c 2 + Af 

so (ii) holds. 

Proof of (iii). Immediate from ( |4.33| ). 



c\\J — Ji 



M 2 



(4.33) 



□ 



Remark, (i)-(iv) of Theorem |4.4j are exactly (a)-(d) of Theorem |47T 
for the Zf sum rules. One therefore expects a version of (e) of that 
theorem to hold as well. Indeed, a modification of the above proof 
yields for J — Jo Hilbert-Schmidt that if , Z±, A{ are finite, then 



z tiJ) = - E NO + ^] + E N/5f I + \{Pt - (P. 



n=l 



and if S 1 , Z x , A 1 are finite, then 
zTf(J) - 



oo 

E 

n=l 



[ln(a n ) - \b n ] + E N#l - Wt - (P?)' 1 )} 



5. Shohat's Theorem with an Eigenvalue Estimate 



Shohat |2T| translated Szego's theory from the unit circle to the real 
line and was able to identify all Jacobi matrices which lead to measures 
with no mass points outside [—2, 2] and have Z(J) < oo. The strongest 
result we know of this type is the following (Theorem 4') from Killip- 
Simon [|l(J (the methods of Nevai [0] can prove the same result): 

Theorem 5.1. Let a (J) C [—2,2]. Consider 

(i) Ao(J) < °° where A is given by (|4.1| ). 

(ii) Z(J) < oo 

(iii) Er=iK-l) 2 + Er=i^<oo 
(i y ) = Ao and is finite. 



v 



limTv^oo J2n=i bn exists and is finite. 
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Then (under a (J) C [—2,2]), we have 

(i) <=► (ii) 
and either one implies (iii), (iv) ; and (v). 
We can prove the following extension of this result: 



Theorem 5.2. Theorem \5. i| remains true if a (J) C [—2,2] is replaced 



bya css (J) C [-2,2] and (p). 



Remarks. 1. Goncar ||, Nevai [p^j , and Nikishin |15j extended 
Shohat-type theorems to allow finitely many bound states outside [—2, 2]. 

2. Peherstorfer-Yuditskii [0 recently proved that £q(J) < oo and 
(ii) implies (iv) and additional results on polynomial asymptotics. 

Proof. Let us suppose first a ess (J) = [—2,2], so J is a BW matrix. By 
Theorem |4.1| (a), (i) of this theorem plus £q(J) < oo implies (ii) of this 
theorem. By Theorem JO(c), (ii) of this theorem implies (i) of this 
theorem. 

If either holds, then (iv) follows from (e) of Theorem |4.1| , (v) from the 
i = 1 case of (g) of Theorem [4.1| . (iii) follows from Theorem [0| if we 
note that £ < oo implies £2 < 00, that Z(J) < 00 implies Z 2 (J) < 00 
and that G{a) = 0((a- l) 2 ). 

If we only have a priori that cr css (J) C [—2, 2], we proceed as follows. 
If Z(J) < 00, a ac (J) D [-2,2] so, in fact, cr css (J) = [-2,2]. If A Q < 00, 
we look closely at the proof of Theorem POYa). ( |4.18| ) does not require 
o" ess (^) = [-2,2], but only that a css (J) C [-2,2]. Thus, A < 00 
implies Z(J) < 00 if £o(J) < 00. □ 

There is an interesting way of rephrasing this. Let the normalized 
orthogonal polynomial obey 

Pn(x)= ln X n + 0(x n - 1 ) (5.1) 

As is well known (see, e.g. fl22||), 

7 n = (a x a 2 . ..any 1 (5.2) 

Thus 

A = liminf ln(7„) (5.3) 

and 

A = limsup ln(7 n ) (5.4) 

Corollary 5.3. Suppose a ess (J) C [—2,2] and £o(J) < 00. Then 
Z(J) < 00 (i.e., the Szego condition holds) if and only if ^ n is bounded 
from above (and in that case, it is also bounded away from 0; indeed, 
lim7„ exists and is in (0, 00)). 
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Remark. Actually, limsup7 n < oo is not needed; liminf7 n < oo is 
enough. 

Proof. By fl5.3|) , 7„ bounded implies A < oo, and thus Z(J) < oo. 
Conversely, Z(J) < oo implies — oo < A = A < oo. So by (|5.2|) , it 
implies -y n is bounded above and below. □ 

In the case of orthogonal polynomials on the circle, Szego's theorem 
says Z < oo if and only if Kj is bounded if and only if Yl'jLil'^jl 2 < 00 
where Kj is the leading coefficient of the normalized polynomials, and ctj 
are the Verblunsky (aka Geronimus, aka reflection) coefficients. In the 
real line case, if one drops the a priori requirement that Sq(J) < 00, 
it can happen that 7„ is bounded but Z(J) = 00. For example, if 
a n = 1 but b n = n -1 , then Z(J) cannot be finite. For J — J G £2, so 



Theorem O(ii) is applicable and thus, A 1 = 00 implies Z(J) = 00. 



But the other direction always holds: 

Theorem 5.4. Let J be a BW matrix with Z(J) < 00 (i.e., the Szego 
condition holds) . Then 7„ is bounded. Moreover, if J — J is compact, 
then lim^oo 7 n exists. 

Remarks. 1. The examples of the next section show Z(J) < 00 is 
consistent with lim7„ = 0. 

2. This result — even without a compactness hypothesis — is known. 



For 7„ is monotone increasing in the measure (see, e.g., Nevai |L4]) and 
so one can reduce to the case where Shohat's theorem applies. 



Proof. By Theorem |4.1| (c), Z(J) < 00 implies Aq < 00 which, by 
implies ■y n is bounded. If J — Jo is compact, then Corollary [12] implies 
that lim7 n = exp(lim — ^j=i m ( a i)) exists but can be zero. □ 



Here is another interesting application of Theorem p72 
Theorem 5.5. Suppose b n > and 



\a n — 1| < 00 (5.5) 



n=l 



Then £ ( J) < 00 if and only if Y^m=i b n < 00. 

Proof. If Y^m=i bn < 00, £o( J) < 00 by (|5.5|) and the bounds of Hundertmark- 
Simon |^]. On the other hand, if £q(J) < 00, ( |5.5|) implies A Q < 00, 
so by Theorem J2n=i is convergent. Since b n > 0, J2n°=i < 
00. □ 
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6. Oin^ 1 ) Perturbations 

In this section, we will discuss examples where 

a n = 1 + an~ l + E a (n) (6.1) 

b n = (3n~ l + E b (n) (6.2) 

where E.(n) is small compared to - in some sense. Our main result 
will involve the very weak requirement on the errors that n(\E a (n) \ + 
\E b (n)\) — > 0. (In fact, we only need the weaker condition that Y^=i( \Ea(j)\ + 
\Ef,(j)\) is o(lnn).) In discussing the historical context, we will consider 
stronger assumptions like 

E.{n) = ^ 2 +o(-\ (6.3) 

We will also mention examples where the leading n _1 terms are replaced 
by {—l) n n~ x . 

These examples are natural because they are just at the borderline 
beyond J — Jo trace class or A (J) < oo or A (J) > — oo. 

Here is the general picture for these examples. The {a,f3) plane is 
divided into four regions: 

(a) \(3\ < —2a. Szego fails at both —2 and 2. 

(b) \(3\<2a. Szego holds. 

(c) (3 > 2 1 a | or f3 = —2a with (3 > 0. Szego holds at +2 but fails at 
-2. 

(d) (3 < —2\a\ or (3 = 2a with (3 < 0. Szego holds at —2 but fails at 
+2. 

Remarks. 1. These are only guidelines and the actual result that we 
can prove requires estimates on the errors. 

2. Put more succinctly, Szego holds at ±2 if and only if 2a ± (3 > 0. 

3. We need strong hypotheses at the edges of our regions where 
\(3 1 = 2\a\. For example, "generally" Szego should hold if (3 = 2a > 0, 
but if a n = 1 + an~ 1 — (n ln(n)) -1 and b n = 2an~ 1 , the Szego condition 
fails (at —2), as follows from Theorem |6.1| below. 

Here is the history of these kinds of problems: 

(1) Pollaczek [17|, [18|, |19[ found an explicit class of orthogonal poly- 
nomials in the region (in our language) \/3\ < —2a, one example for 
each such (a, (3) with further study by Szego [|3|, (but note formula 



^1.7) in the appendix to Szego's book |25[ is wrong — he uses in that 



formula the Bateman project normalization of the parameters he calls 
a,b, not the normalization he uses elsewhere). They found that for 
these polynomials, the Szego condition fails. 
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(2) In [12], Nevai reported a conjecture of Askey that (with 0(n~ 2 ) 
errors) Szego fails for all (a, (3) ^ (0,0). 

(3) In Askey-Ismail found some explicit examples with b n = and 
a > 0, and noted that the Szego condition holds (!), so they concluded 
the conjecture needed to be modified. 

(4) In H , Dombrowski-Nevai proved a general result that Szego holds 
when b n = and a > with errors of the form ( |6.3| ). 

(5) In [[J, Charris-Ismail computed the weights for Pollaczek-type 
examples in the entire (ce,/3) plane to the left of the line a = 1, and 
considered a class depending on an additional parameter, A. While they 
did not note the consequence for the Szego condition, their example is 
consistent with our picture above. 

In addition, we note that in [|12| , Nevai proved that the Szego condi- 
tion holds if a,, = l + (-l) n a/n + 0(n- 2 ) and b n = (-l) n /3/n + 0(n" 2 ). 
With regard to this class, here is our result in this paper: 

Theorem 6.1. Suppose 

oo 

J2(an-l) 2 + b 2 n <oc (6.4) 

n=l 

limsupl — ^^(a„ — 1 ± \b n ) J = oo (6.5) 

^ ^ n=l ' 

for either plus or minus. Then the Szego condition fails at ±2. 

Proof. ( |6.5| ) implies that Af(J) = oo so by Theorem |4.4] (ii) , Zf(J) = 
oo. □ 

Remark. The same kind of argument lets us also prove the failure of 
the Szego conditoin without assuming (|6.4| ) by replacing (|6.5| ) by the 
slightly stronger condition that 



limsup ^- ^2 ^ m ( a «) ± ^ &n ) ) 



oo (6.6) 



for some p > 2. For one can use the step- by-step sum rule for (1 ± 



p 



cos(9)). ( |6.4|) is not needed to control errors in £-sums since they 



have a definite sign near both +2 and —2, and it is not needed to 
replace ln(a) by a — 1 since (|6.6| ) has ln(a n ). 

These considerations yield another interesting result. One can prove 
Theorem [4. II for the weig ht w{6) = l±-cos{6) just as we did it for the 
weight 1. Since w(6) is bounded away from zero, the corresponding 
term is finite if only if Z is. Since p > 2, the corresponding eigenvalue 
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term is finite if and only if £ is. Using Theorem |4.1| (a)-(d) for this 
w(&), we obtain 

Theorem 6.2. Let \p\ < ~ and \q\ < ~. 

(i) If 

N 



and 



lim sup ( - y^(ln(a n ) + pb r , 

N v ^ 

, AT 

lim sup I — > (ln(a„) + qb Tl 

N V i 

x 71=1 



> — oo 



-oo 



t/ien Z(J) = oo. 
(ii) V 



liminf ( - ^(ln(a n ) + pb n ) j < oo 

^ n=l ' 



and 



liminf I — ^^(ln(a n ) + qb Ti 

^ n=l 



oo 



then S (J) = oo. 

In particular, if a n — 1, b n > 0, and = oo, we have Z(J) = 

oo and £o(-0 = oo. On the other hand, if instead Y^=i b n < oo, then 
Z(J) < oo and £ (J) < oo (see [J). 

Corollary 6.3. Ifa n ,b n are given by (|6.1|), ( |6.2| ) 

lim n[|S o (n)| + |S 6 (n)|] = (6.7) 

n— >oo 

and 2a ± /3 < 0, then the Szego condition fails at ±2. 

Remarks. 1. This is intended as separate results for + and for — . 
2. All we need is 

N 

lim (\nN)- i y^(\E a (n)\ + \E b (n)\) = 

n=l 

instead of ( |6.7| ). In particular, trace class errors can be accommodated. 
Proof. If (0) holds, 

AT 

J^(a n - 1) ± \b n = (a ± \f3) mN + o(lniV) 

n=l 

so (pD holds if 2a ± /3 < 0. □ 
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As for the complementary region \/3\ < 2a, one of us has proven (see 
Zlatos p7|) the following: 



Theorem 6.4 (Zlatos g7J). Suppose \p\ < 2a and 

a n = 1 + an~ x + 0{n~ l ~ £ ) 
b n = fjn' 1 + 0(n- x - £ ) 

/or some £ > 0. TTien i/ie Szego condition holds. 



Remarks. 1. This is a corollary of a more general result (see [27]). 

2. In these cases, — J2n=i m ( a «) diverges to — oo. This is only con- 
sistent with ( |4.18| ) because £o(J) = oo, that is, the eigenvalue sum 
diverges and the two infinities cancel. 



We can use these examples to illustrate the limits of Theorem [41 



(1) If a n = 1 and b n = -, then Z(J) = oo (by Corollary |6.3| ) while 
A (J) = Aq(J) < oo^Thus S {J) = oo. 

(2) If a n = l- J, b n = 0, then Z(J) = oo (by Corollary 0) A (J) = 
Ao(J) — °° j but £o(J) < 00 since J has no spectrum outside 
[-2,2]. 

(3) If a n = 1 + b n = 0, then Z(J) < 00 (by Theorem |6.4|) , but 
A (J) = A Q (J) = —00 and so £o(-0 = °o- 

Finally, we note that Nevai's [0 (— l) n /n theorem shows that we 
can have Z( J) < 00, £o(-0 < °°> an d have the sums a n and/or b n 
be only conditionally and not absolutely convergent. 
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